Towards Interactive Object Recognition
نویسندگان
چکیده
I. INTRODUCTION Object recognition is a key component of service robots for finding and handling objects. Current state-of-the-art object recognition systems recognize objects based on static images [7, 8]. However, these systems prove limited in cases when objects are in ambiguous orientations or distinctive features are hidden, e.g., due to the pose of the object. A popular approach to tackle this problem is active perception [1, 3], where the robot intelligently moves its camera to reveal more information about the scene. However, there are cases where this approach will fail because distinctive features are hidden, for example, on the bottom side of the object (see Fig. 1). These cases are particularly common in cluttered environments, where features might be occluded not only due to the pose of the object but also by other items in the scene. It has been recently studied in the area of interactive perception that interacting with the scene exposes new possibilities to tackle common perception problems. This paper addresses both challenges—selecting an object of a cluttered scene for manipulation and picking the optimal movement of this object—in an information-theoretic way to improve interactive perception methods. Interacting with a scene to improve perception by revealing informative surfaces has been particularly explored in the area of segmentation. Examples are: interactive segmentation of rigid objects being moved by a robot [5], segmentation of articulated objects [4], and disambiguation of segmentation hypothesis [2]. However, none of these approaches reason about what actions to take in order to achieve the goal. In this work we introduce a probabilistic method for choosing object manipulation actions to optimally reveal information about objects in a scene based on robot's observations. To the best of our knowledge, the problem of interactive object recognition has not been addressed before. Our approach determines the optimal action for a robot to interact with objects and adjust their pose to reveal discriminative features for determining their identity. In the ambiguous book example (see Fig. 1), this means flipping the book over and observing the cover, which results in more confident recognition. Our method is based on a probabilistic graphical model for feature-based object and pose recognition. By inferring posterior distributions of object probabilities conditioned on all previous actions and observations, our approach enables a robot to select the optimal action to reduce the uncertainty of the object. The key contributions of this approach are: (a) it presents
منابع مشابه
Interactive Museum Guide: Accurate Retrieval of Object Descriptions
In this paper we describe an interactive guide that is able to automatically retrieve information about objects on display in museums. A visitor can point this mobile device at exhibits and automatically retrieve descriptions about objects of interest in a non-distractive way. We investigate Gaussian image intensity attenuation and a foveation-based preprocessing approach which both allow to fo...
متن کاملUrban Vegetation Recognition Based on the Decision Level Fusion of Hyperspectral and Lidar Data
Introduction: Information about vegetation cover and their health has always been interesting to ecologists due to its importance in terms of habitat, energy production and other important characteristics of plants on the earth planet. Nowadays, developments in remote sensing technologies caused more remotely sensed data accessible to researchers. The combination of these data improves the obje...
متن کاملExploring Biologically-Inspired Interactive Networks for Object Recognition
This thesis deals with biologically-inspired interactive neural networks used for the task of object recognition. Such networks offer an interesting alternative approach to traditional image processing techniques. Although the networks are very powerful classification tools, they are difficult to handle due to their bidirectional interactivity. This is one of the main reasons why these networks...
متن کاملSnapLink: Interactive Object Registration and Recognition for Augmented Desk Interface
Identification of objects in a real world plays a key role for human-computer interaction in a computer-augmented environment using augmented reality techniques. To provide natural and intuitive interaction in such environments, it is necessary for an interface system to know which objects a user is using. In previously developed interface systems, real objects are identified by using specially...
متن کاملApplication of Combined Local Object Based Features and Cluster Fusion for the Behaviors Recognition and Detection of Abnormal Behaviors
In this paper, we propose a novel framework for behaviors recognition and detection of certain types of abnormal behaviors, capable of achieving high detection rates on a variety of real-life scenes. The new proposed approach here is a combination of the location based methods and the object based ones. First, a novel approach is formulated to use optical flow and binary motion video as the loc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014